Algorithm Application Support Vector Machine with Genetic Algorithm Optimization Technique for Selection Features for the Analysis of Sentiment on Twitter

نویسنده

  • MOCHAMAD WAHYUDI
چکیده

Twitter has become one of the most popular micro-blogging platform, recently. Millions of users can share their thoughts and opinions about various aspects and activites. Therefore, twitter considered as a rich source of information for decision-making and sentiment analysis. In this case, the sentiment is aimed to overcome the problem of automatically classifying user tweets into positive opinion and negative opinion. The classifier Support Vector Machine (SVM) used in this study is a machine learning technique that is popular text classifiers, as Support Vector Machine (SVM) algorithm is one that has a linear calcification of the main principles for determining the linear separator in the search space that can best separate the two classes different. But the Support Vector Machine (SVM) has the disadvantage that the appropriate parameter selection problem. The tendency in recent years is to simultaneously optimize the features and parameters for Support Vector Machine (SVM), so as to improve the accuracy of classification on Support Vector Machine (SVM). Genetic Algorithm has the potential to produce better features and becomes optimal parameters at the same time. This research generate text classification in the form of positive and negative tweets on twitter. Measurement accuracy is based on Support Vector Machine (SVM) before and after using a Genetic Algorithm. Evaluation was performed using 10 fold cross validation while accuracy is measured by the confusion matrix and ROC curves. The results of the study showed an increase in accuracy of Support Vector Machine (SVM) from 63.50% to 93.50%.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Feature Selection Using Multi Objective Genetic Algorithm with Support Vector Machine

Different approaches have been proposed for feature selection to obtain suitable features subset among all features. These methods search feature space for feature subsets which satisfies some criteria or optimizes several objective functions. The objective functions are divided into two main groups: filter and wrapper methods.  In filter methods, features subsets are selected due to some measu...

متن کامل

Forecasting Stock Price Movements Based on Opinion Mining and Sentiment Analysis: An Application of Support Vector Machine and Twitter Data

Today, social networks are fast and dynamic communication intermediaries that are a vital business tool. This study aims at examining the views of those involved with Facebook stocks so that we can summarize their views to predict the general behavior of this stock and collectively consider possible Facebook stock price movements, and create a more accurate pattern compared to previous patterns...

متن کامل

Intelligent application for Heart disease detection using Hybrid Optimization algorithm

Prediction of heart disease is very important because it is one of the causes of death around the world. Moreover, heart disease prediction in the early stage plays a main role in the treatment and recovery disease and reduces costs of diagnosis disease and side effects it. Machine learning algorithms are able to identify an effective pattern for diagnosis and treatment of the disease and ident...

متن کامل

Application of Genetic Algorithm Based Support Vector Machine Model in Second Virial Coefficient Prediction of Pure Compounds

In this work, a Genetic Algorithm boosted Least Square Support Vector Machine model by a set of linear equations instead of a quadratic program, which is improved version of Support Vector Machine model, was used for estimation of 98 pure compounds second virial coefficient. Compounds were classified to the different groups. Finest parameters were obtained by Genetic Algorithm method ...

متن کامل

Heart Rate Variability Classification using Support Vector Machine and Genetic Algorithm

Background: Electrocardiogram (ECG) is defined as an electrical signal, which represents cardiac activity. Heart rate variability (HRV) as the variation of interval between two consecutive heartbeats represents the balance between the sympathetic and parasympathetic branches of the autonomic nervous system.Objective: In this study, we aimed to evaluate the efficiency of discrete wavelet transfo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016